Search CORE

34 research outputs found

Competitive function approximation for reinforcement learning

Author: Agostini Alejandro Gabriel
Celaya Llover Enric
Publication venue
Publication date: 01/01/2014
Field of study

The application of reinforcement learning to problems with continuous domains requires representing the value function by means of function approximation. We identify two aspects of reinforcement learning that make the function approximation process hard: non-stationarity of the target function and biased sampling. Non-stationarity is the result of the bootstrapping nature of dynamic programming where the value function is estimated using its current approximation. Biased sampling occurs when some regions of the state space are visited too often, causing a reiterated updating with similar values which fade out the occasional updates of infrequently sampled regions. We propose a competitive approach for function approximation where many different local approximators are available at a given input and the one with expectedly best approximation is selected by means of a relevance function. The local nature of the approximators allows their fast adaptation to non-stationary changes and mitigates the biased sampling problem. The coexistence of multiple approximators updated and tried in parallel permits obtaining a good estimation much faster than would be possible with a single approximator. Experiments in different benchmark problems show that the competitive strategy provides a faster and more stable learning than non-competitive approaches.Preprin

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Digital.CSIC

Stochastic approximations of average values using proportions of samples

Author: Agostini Alejandro Gabriel
Celaya Llover Enric
Publication venue
Publication date: 01/01/2011
Field of study

IRI Technical ReportIn this work we explain how the stochastic approximation of the average of a random variable is carried out when the observations used in the updates consist in proportion of samples rather than complete samples.Preprin

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

A competitive strategy for function approximation in Q-learning

Author: Agostini Alejandro Gabriel
Celaya Llover Enric
Publication venue
Publication date: 01/01/2011
Field of study

In this work we propose an approach for generalization in continuous domain Reinforcement Learning that, instead of using a single function approximator, tries many different function approximators in parallel, each one defined in a different region of the domain. Associated with each approximator is a relevance function that locally quantifies the quality of its approximation, so that, at each input point, the approximator with highest relevance can be selected. The relevance function is defined using parametric estimations of the variance of the q-values and the density of samples in the input space, which are used to quantify the accuracy and the confidence in the approximation, respectively. These parametric estimations are obtained from a probability density distribution represented as a Gaussian Mixture Model embedded in the input-output space of each approximator. In our experiments, the proposed approach required a lesser number of experiences for learning and produced more stable convergence profiles than when using a single function approximator.Peer ReviewedPreprin

UPCommons. Portal del coneixement obert de la UPC

Efficient interactive decision-making framework for robotic applications

Author: Agostini Alejandro Gabriel
Torras Carme
Woergoetter Florentin
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

© . This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/The inclusion of robots in our society is imminent, such as service robots. Robots are now capable of reliably manipulating objects in our daily lives but only when combined with artificial intelligence (AI) techniques for planning and decision-making, which allow a machine to determine how a task can be completed successfully. To perform decision making, AI planning methods use a set of planning operators to code the state changes in the environment produced by a robotic action. Given a specific goal, the planner then searches for the best sequence of planning operators, i.e., the best plan that leads through the state space to satisfy the goal. In principle, planning operators can be hand-coded, but this is impractical for applications that involve many possible state transitions. An alternative is to learn them automatically from experience, which is most efficient when there is a human teacher. In this study, we propose a simple and efficient decision-making framework for this purpose. The robot executes its plan in a step-wise manner and any planning impasse produced by missing operators is resolved online by asking a human teacher for the next action to execute. Based on the observed state transitions, this approach rapidly generates the missing operators by evaluating the relevance of several cause–effect alternatives in parallel using a probability estimate, which compensates for the high uncertainty that is inherent when learning from a small number of samples. We evaluated the validity of our approach in simulated and real environments, where it was benchmarked against previous methods. Humans learn in the same incremental manner, so we consider that our approach may be a better alternative to existing learning paradigms, which require offline learning, a significant amount of previous knowledge, or a large number of samples.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Digital.CSIC

A general strategy for interactive decision-making in robotic platforms

Author: Agostini Alejandro Gabriel
Torras Carme
Wörgötter Florentin
Publication venue
Publication date: 01/01/2011
Field of study

This work presents an intergated strategy for planning and learning suitable to execute tasks with robotic platforms without any previous task specification. The approach rapidly learns planning operators from few action experiences using a competitive strategy where many alternatives of cause-effect explanations are evaluated in parallel, and the most successful ones are used to generate the operators. The system operates without task interruption by integrating in the planning-learning loop a human teacher that supports the planner in making decisions. All the mechanisms are integrated and synchronized in the robot using a general decision-making framework.Preprin

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Integrating task planning and interactive learning for robots to work in human environments

Author: Agostini Alejandro Gabriel
Torras Carme
Wörgötter Florentin
Publication venue: AAAI Press. Association for the Advancement of Artificial Intelligence
Publication date: 01/01/2011
Field of study

Human environments are challenging for robots, which need to be trainable by lay people and learn new behaviours rapidly without disrupting much the ongoing activity. A system that integrates AI techniques for planning and learning is here proposed to satisfy these strong demands. The approach rapidly learns planning operators from few action experiences using a competitive strategy where many alternatives of cause-effect explanations are evaluated in parallel, and the most successful ones are used to generate the operators. The success of a cause-effect explanation is evaluated by a probabilistic estimate that compensates the lack of experience, producing more confident estimations and speeding up the learning in relation to other known estimates. The system operates without task interruption by integrating in the planning-learning loop a human teacher that supports the planner in making decisions. All the mechanisms are integrated and synchronized in the robot using a general decision-making framework. The feasibility and scalability of the architecture are evaluated in two different robot platforms: a Stäubli arm, and the humanoid ARMAR III.Peer ReviewedPostprint (author’s final draft

UPCommons. Portal del coneixement obert de la UPC

Quick learning of cause-effects relevant for robot action

Author: Agostini Alejandro Gabriel
Torras Carme
Wörgötter Florentin
Publication venue
Publication date: 01/01/2010
Field of study

In this work we propose a new paradigm for the rapid learning of cause-effect relations relevant for task execution. Learning occurs automatically from action experiences by means of a novel constructive learning approach designed for applications where there is no previous knowledge of the task or world model, examples are provided on-line during run time, and the number of examples is small compared to the number of incoming experiences. These limitations pose obstacles for the existing constructive learning methods, where on-line learning is either not considered, a significant amount of prior knowledge has to be provided, or a large number of experiences or training streams are required. The system is implemented and evaluated in a humanoid robot platform using a decision-making framework that integrates a planner, the proposed learning mechanism, and a human teacher that supports the planner in the action selection. Results demonstrate the feasibility of the system for decision making in robotic applications.Preprin

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

On-line learning of macro planning operators using probabilistic estimations of cause-effects

Author: Agostini Alejandro Gabriel
Celaya Llover Enric
Torras Carme
Wörgötter Florentin
Publication venue
Publication date: 01/01/2008
Field of study

In this work we propose an on-line learning method for learning action rules for planning. The system uses a probabilistic approach of a constructive induction method that combines a beam search with an example-based search over candidate rules to find those that more concisely describe the world dynamics. The approach permits a rapid integration of the knowledge acquired from experience. Exploration of the world dynamics is guided by the planner, and – if the planner fails because of incomplete knowledge – by a teacher through action instructions

UPCommons. Portal del coneixement obert de la UPC

La renovación de la palabra en el bicentenario de la Argentina : los colores de la mirada lingüística

Author: Abusamra Valeria
Abusamra Valeria
Acosta Silvia Patricia
Adelstein Andreína
Adem Adriana
Agostini de Sánchez Mónica
Aguiar Marígia Ana
Aguirre de Quevedo Lidia
Aguirre Daniela
Aguirre Luis Alejandro
Aibar Marilina
Albano Hilda R.
Albarracín Claudia Carina
Albiñana Graciela
Ali Valeria
Alonso Damiana
Amadio Débora
Anglada Liliana
Anglada Liliana
Apud Higonet M. Belén
Araya María Teresa
Arbusti Marcia Sonia
Armata Olga
Avellana Alicia
Azzolini Martín
Balatti Fernanda
Baldasso de Fiocchetta Alicia
Barroso Ilda E.
Battista Emiliano
Bernardi Lucía
Berri Marina
Boccia Cristina
Bonorino María Paula
Bortolon Mariela Andrea
Bosio Iris Viviana
Brain Virginia
Broilo Cecilia
Buffa Ivana
Buzelin Haro Corina Margarita
Calcopietro Martín
Calvo Garbarino Noelia
Calvo Ana I.
Carminatti Mariana
Carol Javier
Carrillo María Cristina
Carrizo Alicia Eugenia
Cartoceti Romina
Cartoceti Romina
Casajús Andrea
Casares María Fernanda
Castañeda Claudia
Castel Víctor M.
Castilla Carlos Enrique
Castro Adriana
Castro Carmen
Cegarra Juan José
Cohen de Chervonagura Elisa
Colombo Natalia Virginia
Constantino Gustavo Daniel
Costa Natalia V.
Cuadros Mirtha
Cubo de Severino Liliana
Cucatto Andrea
Cuestas Anahí
Cuñarro Mariana
Cárdenas Viviana
Dalla Costa Natalia V.
Dandrea Fabio Daniel
Davis Efraín
De la Torre María Gabriela
Demagistri María Silvina
Di Iorio Osvaldo R.
Difabio Hilda Emilia
Douglas Silvina
Dudzicz Verónica
Dvoskin Gabriel
Engemann Marcela
Escandell-Vidal Victoria
Escobar Negri Matilde Belén
Etkin Sergio
Fanese Griselda
Farías Alejandra
Favarotto Victoria
Fernández Lávaque Ana María
Ferreres Aldo
Ferreres Aldo
Flax Rocío
Forte Nora B.
Fructuoso Libertad
Gaido Angélica
Gallardo Susana
Galli Graciela
García Negroni María Marta
García Paula Sylvina
Gava Ileana Yamina
Germani Miriam P.
Ghirardotto María Verónica
Giammatteo Mabel
Giménez Florencia
Giollo Natalia
Glozman Mara
González María Susana
González Nora
Gordo Norma
Grasso Marina
Greco Florencia
Guerra Mónica
Gutiérez Yurena María
Gutiérrez Analía
Hall Beatriz
Hasler Felipe
Hellín Lucía
Hidalgo Leonardo Matías
Hipperdinger Yolanda
Iacoboni Gabriela
Ibañez Karina
Insirillo Patricia
Isuani de Aguiló María Elena
Julián Gisele G.
Kaller Andrés
Kejner Emilse
Kevorkian Analía E.
Krieger Maria da Graça
Kuguel Inés
Lacanna Georgina
Lahitte Pilar
Lauría Daniela
Leonetti Manuel
Llomparte Cecilia Castro
Lucena Silvio Alexis
Lupprich Edith
López Esther
López María Isabel
Macagno Laura
Magariños María Victoria
Mandatori Laura D.
Marcovecchio Ana M.
Marra de Acebedo Leonor
Martínez Ramacciotti Javier
Matienzo Teresita
Mauro Karina
Menti Alejandra B.
Merzig Brigitte
Messineo María Cristina
Milano María Inés
Minguell Antonia Esther
Miranda Lidia Raquel
Montes Lilian Elvira
Morchio María José
Morón Usandivaras Mariana
Moser Karolin
Moyano Susana
Moyetta Daniela S.
Müller Gisela
Nafá María Lourdes
Nardechia Viviana
Nascimento Hiliana Alves dos Santos
Navarro Paula C.
Naveira Liliana María
Negrelli Fabián H.
Negri Silvina Analía
Nieto González Analía
Núñez Cristina del Valle
Oliveira Sargentini Vanice María
Orellano de Marra Verónica
Orsi Laura
Otero Julia
Pacagnini Ana M. J.
Padilla Constanza
Palma Alejandra Gabriela
París Luis
Pascual Graciela B.
Pasquini Marina
Pazgón Elisa
Pereira Batista Rosinalda
Pessi María Soledad
Picelille Silvia
Pico María Lelia
Picón Estela
Porta Andrés Osvaldo
Possamai Camilotti Fabrina C.
Pérez Inés Gimena
Pérez Liliana
Raffaghelli Juliana Elisa
Rampoldi Cora C.
Ramírez Gelbes Silvia
Rearte Juan Lázaro
Redonder Nidia
Reguera Alejandra
Ribotta Patricia G.
Riestra Dora
Rivas Lucía I.
Rodas Juana del Valle
Rojas E. Gustavo
Rojo Guillermo
Roldán Luciana
Romano María Belén
Romero Daniel
Rosemberg Celia R.
Rébola María Cristina
Sal Paz Julio César
Sala María Elisa
Salas Figueroa Patricia
Salmaso Grisel
Salvo de Vargas María Estela
Sampedro Bárbara
Sampedro Bárbara
Sanou Rosa María
Santos Jocenilson Ribeiro dos
Sastre María Silvia
Savio Karina
Seoane Carolina
Serra Piana Marcela A.
Silveira de Araujo Leandro
Soto Guillermo
Stoll Eva
Suárez Cepeda Sonia
Szretter Noste Mariana
Taboada María Beatriz
Tapia Stella Maris
Torino de Morales Marta Elena
Tosi Carolina
Trombetta Augusto M.
Vidal Alejandra
Vitale Guillermina
Vivas Jorge
Vivas L.
Viñas Quiroga Ingrid
Voltarel Silvina María
Vázquez Villanueva Graciana
Vázquez Stella Maris
Wingeyer Hugo Roberto
Yerro Matías
Zamarreño Silvia A.
Zamuner Amanda
Zangla Alicia
Álvarez Chamale Fernanda María
Álvarez Guadalupe
Ávila Carina
Publication venue: Ediciones Biblioteca Digital UNCuyo
Publication date: 01/01/2010
Field of study

El libro reúne trabajos en los que se exponen resultados de investigaciones presentadas por investigadores de Argentina, Chile, Brasil, España, Italia y Alemania en el XII Congreso de la Sociedad Argentina de Lingüística (SAL), Bicentenario: la renovación de la palabra, realizado en Mendoza, Argentina, entre el 6 y el 9 de abril de 2010. Las temáticas abordadas en los 167 capítulos muestran las grandes líneas de investigación que se desarrollan fundamentalmente en nuestro país, pero también en los otros países mencionados arriba, y señalan además las áreas que recién se inician, con poca tradición en nuestro país y que deberían fomentarse. Los trabajos aquí publicados se enmarcan dentro de las siguientes disciplinas y/o campos de investigación: Fonología, Sintaxis, Semántica y Pragmática, Lingüística Cognitiva, Análisis del Discurso, Psicolingüística, Adquisición de la Lengua, Sociolingüística y Dialectología, Didáctica de la lengua, Lingüística Aplicada, Lingüística Computacional, Historia de la Lengua y la Lingüística, Lenguas Aborígenes, Filosofía del Lenguaje, Lexicología y Terminología

Repositorio OAI Biblioteca Digital Universidad Nacional de Cuyo

A competitive strategy for function approximation in Q-learning

Author: Agostini Alejandro Gabriel
Celaya Llover Enric
Publication venue
Publication date: 01/01/2011
Field of study